Towards a systematic approach to manual annotation of code smells

نویسندگان

چکیده

Code smells are structures in code that may indicate maintainability issues. They challenging to define, and software engineers detect them differently. Mitigation of this problem could be an AI smell detector. However, develop it, we need a standardized benchmark dataset. Existing datasets suffer from (1) annotation subjectivity, (2) lack ground-truth consensus among annotators, (3) reproducibility This paper aims systematic manual procedure addresses these We tailored the prescriptive natural language processing methodology detection: cross-validate annotations mitigate clear guidelines reach consensus, follow literature recommendations for open-source our tools extracted existing empirical research. The annotators refined their understanding task through proof-of-concept encompassing retrospective discussion disagreement resolution then performed full annotation. confirmed was reached by measuring consistency. Our contributions proposed procedure, novel dataset C# projects, annotators' experience report, open-sourced supporting tool.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Approach of Visualizing Code Smells

Code smells are indicators of design flaws that may have negative effects on the comprehensibility and maintainability of a software system. Previous studies on software clones have shown that visualization of findings are useful to identify clones suitable for refactorings. However, techniques to visualize code smells in general are rare and, thus, an interesting field of research to bridge th...

متن کامل

Towards a Catalogue of Refactorings and Code Smells for AspectJ

In this paper, we contribute to the characterisation of a programming style specific to aspect-oriented programming. For this purpose, we present a collection of refactorings for aspect-oriented source code, comprising refactorings to enable extraction to aspects of crosscutting concerns from objectoriented legacy code, the subsequent tidying up of the extracted aspects and factoring out of com...

متن کامل

Code smells

s, titles and keywords were analysed by VOSviewer using default parameters. All common terms like study, baseline, control group, trend, method were excluded from the analysis. Three maps were induced (1) clustered landscapes presenting popularity of terms (more popular terms are presented in larger squares), associations between terms (terms locted near each other are stongly associated) and r...

متن کامل

How Do Community Smells Influence Code Smells?

Code smells reflect sub-optimal patterns of code that often lead to critical software flaws or failure. In the sameway, community smells reflect sub-optimal organisational and socio-technical patterns in the organisational structure of the software community. To understand the relation between the community smells and code smells we start by surveying 162 developers of nine opensource systems. ...

متن کامل

from linguistics to literature: a linguistic approach to the study of linguistic deviations in the turkish divan of shahriar

chapter i provides an overview of structural linguistics and touches upon the saussurean dichotomies with the final goal of exploring their relevance to the stylistic studies of literature. to provide evidence for the singificance of the study, chapter ii deals with the controversial issue of linguistics and literature, and presents opposing views which, at the same time, have been central to t...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Science of Computer Programming

سال: 2023

ISSN: ['1872-7964', '0167-6423']

DOI: https://doi.org/10.1016/j.scico.2023.102999